Training Reward Visualization
Question Generation Environment - DIAL Task
How to use: Click on any point in the chart below to view example rollouts from that time in training. Each rollout shows the full multi-turn conversation between the questioner and boundary models.
📊
Click on a point in the chart above to view rollout examples